Directly Observing Momentum Transfer in Twin-Slit "Which- Way" Experiments 
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Is the destruction of interference by a which-way measurement due to a random momentum 
transfer p > h/s, with s the slit separation? The weak-valued probability distribution P W v(p), 
which is directly observable, provides a subtle answer. P wv (p) cannot have support on the interval 
[—h/s, h/s]. Nevertheless, its moments can be identically zero. 
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I. INTRODUCTION 

Making a position measurement to determine through 
which slit a particle has passed necessarily destroys the 
twin-slit interference pattern (see Fig. 1). This is the 
canonical example of Bohr's complementarity principle 
To defend this principle against Einstein's recoiling 
slit gedankenexperiment, Bohr relied upon the recently 
(in 1927) derived Heisenberg uncertainty relation Q to 
show that the position measurement would cause an "un- 
controllable change in the momentum" p > H/s, where s 
is the slit separation p|. This is just what is required to 
wash out the interference pattern, thereby enforcing com- 
plementarity. Bohr's argument was famously reiterated 
by Feynman for a measurement using Heisenberg's 
light microscope Q. 

In 1991 Scully, Englert and Walther |5j proposed a 
new which-way (welcher Weg) scheme for which, they 
calculated, no momentum would be transferred to the 
particle. Thus, they concluded, the arguments of Bohr 
and Feynman were wrong in general, and that comple- 
mentarity must be deeper than uncertainty. Storey, Tan, 
Collett and Walls (STCW) disagreed, proving a the- 
orem that if interference is destroyed then there is some 
transverse momentum transfer > H/s. The debate 0,0 
was partially resolved when it was pointed out that 
SEW and STCW were using different concepts of "mo- 
mentum transfer" , and that their analyses were in fact 
complementary. Aspects of both are found in two further 
characterizations of momentum transfer that have since 
been proposed 0, . However, these characterizations 
are not obtainable from experiments directly (i.e. in a 
way that would make sense to a classical physicist). 

In this Letter I propose a new and attractive resolution: 
a way to directly observe a single probability distribution 
P(p) for momentum transfer p in a which-way measure- 
ment (WWM). My method shows that SEW were right, 
in that the variance of P(p) is zero for their scheme, 
but that STCW were also right, in that P(p) cannot 
have support on the interval [—H/s, H/s]. This is possible 



only because P(p) must be measured as a weak valued 
|14j probability distribution, and hence may take nega- 
tive values. 

The body of this Letter is organized as follows. First I 
introduce the general formalism of WWMs, and use this 
to explain the four characterizations of momentum trans- 
fer proposed in Refs. |M la. 1 1 (A lllj . I discuss these char- 
acterizations in terms of 6 key properties, showing that 
each lacks at least 2 properties. This motivates the intro- 
duction of the present approach, based on weak values, 
which has all properties except non-negativity of P(p). 



II. WHICH-WAY MEASUREMENTS 

Say the two slits in the Young's interferometer are cen- 
tred at x = ±s/2. Note that it is necessary to restrict 
the discussion to an interferometer of this kind, where 
the initial superposition is in transverse position and the 
free evolution preserves the conjugate quantity (trans- 
verse momentum), so that the issue of loss of visibility 
related directly to transverse momentum transfer. Anal- 
ogous issues may be identifiable for other sorts of inter- 
ferometer, but that is beyond the scope of this Letter. 

A generalized measurement of x transforms the initial 
state produced by the twin slits, \ipi), into a final mixed 
state, pf, according to [rilTol] 



(2.1) 



'Electronic address: H.Wiseman@grifEth.edu.au 



Here {£} is the set of measurement results, which here 
is summed over, as I am using the Einstein summation 
convention. This gives the final state averaged over all 
possible results, because 0^\ipi) represents the unnormal- 
ized state given the result £, and its modulus squared is 
the probability to obtain that result £. 

Note that there are infinitely many sets of operators 
{Of} which give the same average transformation l|2.1|) 
|12| . These different sets can be obtained physically by 
using the same measurement interaction between the sys- 
tem and the apparatus, but from resolving this apparatus 
in different bases. For example, for many measurements 
there is a "quantum eraser" basis |13| complementary to 
the basis that gives the best which-way information. In 
these different bases, the states O^ipi) conditioned on 
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individual results may be completely different, but the 
average state pf is the same. 

For Eq. (|2.1|) to describe a position measurement, it is 
necessary to restrict the operators Oj to be functions of 
the position operator x. Then in the position represen- 
tation one finds 

(x\6t\rl>i) = O t (x)iPi(x). (2.2) 

Thus the measurement is defined by the functions 
{0^(x)}, which are constrained mathematically only by 
the completeness condition 

Vx Ot(x)0%(x) = I (2.3) 

which ensures that pf is normalized. For narrow slits, 
[i.e. \ip{x)\ 2 ~ 5(2x + s) + S(2x - s)], the visibility of 
the far field interference pattern can be shown to be 
given by 

V = \0 6 (-s/2)OI(s/2)\. (2.4) 

It is evident from Eq. I|2.2|l that if all of the 0^(x) 
are flat in the region of the slits, then a single slit ini- 
tial wavefunction ipi(x) will emerge unchanged from the 
measurement region. In particular, its momentum dis- 
tribution (the diffraction pattern) will be no wider than 
that with no measurement. This phenomenon is compat- 
ible with the complete loss of visibility (|2.4I) , and indeed 
would occur for the micromaser experiment proposed by 
SEW. It is by this directly observable measure that SEW 
claimed there would be no momentum transfer. 

In response, STCW proved that Eqs. I|2.3|l and (|2.4() 
imply that for at least one £, the Fourier transform 

O c (p) = (2irh)- 1/2 JdxO^(x)e^ xp/h (2.5) 

does not have support on the interval (—h/s,h/s) if 
V = 0. The significance of this is that in the momen- 
tum representation Eq. I|2.2I) becomes 

(p|0 € hfc>= fdp6t(p)$i(p-p). (2.6) 

This implies that 0^{p) can be interpreted as a mo- 
mentum transfer amplitude distribution. Moreover, from 
Eq. H2.(jf> . if the initial state \tpi) were ap = momentum 
eigenstate then the final momentum distribution would 
be (p\pf\p) — 0^(p)0^(p). Thus the momentum transfer 
in the STCW theorem is also directly observable, as the 
creation of momentum components greater than or equal 
to h/s. 

For the Einstein recoiling slit [1| and Feynman light 
microscope it turns out that 

(P\PJ\P) = J d P ^classical (p) | V>i (p ~ p)\ 2 (2.7) 

where -P c iassicai(p) is non- negative. For such schemes 
the 0^{x) can be chosen as J exp(-ik^x), and 



Classical (p) = S e N^S(p - /%). For these schemes, the 
final momentum distribution is broadened identically for 
all initial states, and the SEW and STCW characteri- 
zations of momentum transfer agree. However, for some 
schemes (most notably that of SEW) the functions (x) 
are necessarily not of this form, which is why more than 
one characterization of momentum transfer is possible in 
general. These facts were first pointed out by Wiseman 
and Harrison [|J. 

The essential problem is that if the momentum trans- 
fer (|2.(il) is not of classical form 1)2. 7fl . then the transfer of 
momentum is not clearly defined for an initial twin-slit 
state, as it does not have definite momentum. The ob- 
vious solution is to use a quantum formalism in which a 
momentum is associated with the particle even when it 
is not in a momentum eigenstate. Two such formalisms 
have been investigated in the past. 

In the Wigner function formalism [lOj, the particle is 
described by a pseudo probability distribution on phase 
space W{x,p) which gives the correct marginal distribu- 
tions for x and p. The transformation Eq. I|2.1|l becomes 

W f (p,x) = JdpW l {x,p- p)P wigncl (p;x). (2.8) 

Here fwigncr(p;^) is defined in terms of {O^} and acts 
formally as an cc-dependent momentum transfer proba- 
bility distribution. Both the local momentum transfer at 
x = ±s/2, and the nonlocal momentum transfer at x = 
(midway between the slits) are relevant to the final mo- 
mentum distribution. 

In the Bohmian formalism an individual parti- 
cle has a definite position x and momentum p = mx = 
Re[— ihip'{x)/tf){x)]. The probability distribution for x is 
the usual \i/j(x)\ 2 , but that for p equals \ip(p)\ 2 only in 
the far field By following the trajectories of individ- 
ual particles one can calculate a time-dependent momen- 
tum transfer probability distribution i-Bohm(p;£) where 
t is the time after the WWM [HI In this formalism 
-PBohm(p, + ) gives the local momentum transfer, but the 
momentum continues to change after the measurement 
(i.e. nonlocally), until the far field (t oo). 

III. PROPERTIES OF 
p-CHARACTERIZATIONS 

By considering the above four characterizations it is 
possible to identify certain key desirable properties. They 
are desirable in the sense that classically they would be 
properties of a complete description of momentum trans- 
fer. They are key in the sense that all are properties of 
some of the above characterizations, but none are prop- 
erties of all. The details are given in table 1. 

The first property is that there should be a probability 
distribution P(p) in order to fully characterize p. This 
is as opposed to partial characterizations, such as the 
(lack of) increase in the width of the single-slit momen- 
tum distribution (SEW), or the creation of momentum 
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TABLE I: Table of properties of various proposed methods of characterizing the momentum transfer in a WWM. A or x 
indicates the presence or absence of a property, and a — indicates non-applicability. 
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values outside a certain range from a p = momentum 
eigenstate (STCW). The second property is that P(p), 
if it exists, should be non-negative. 

The third property is that p should reflect any change 
in the momentum distribution. Clearly in a which-way 
scheme the momentum distribution does change (the 
fringes disappear) so if p is characterized by a distri- 
bution, it should not equal 8{p). The fourth property is 
that p should reflect the change in the moments of the 
momentum. If the single-slit diffraction patterns are un- 
affected by the WWM, as in the scheme of SEW, then 
it turns out |lf)j | that the moments of the twin-slit in- 
terference pattern are also exactly unchanged. For such 
schemes, the moments of p should be zero also. 

The fifth property is that the characterization of p 
should be directly observable, in the sense defined in the 
introduction. In particular, no knowledge of quantum 
mechanics (such as the wave properties of particles, or 
even the value of K) should be required for understanding 
this part of the experiment, or for processing the data. 

The sixth and final property is that the character- 
ization should not depend upon the basis into which 
the measurement apparatus is resolved. That is, for a 
given measurement interaction, giving the transforma- 
tion 1)2.1(1 . the characterization should not depend upon 
the particular set of functions {O^} arising from resolv- 
ing the apparatus in a particular basis (as discussed in 
Sec. II). For example, the momentum transfer should be 
the same in the quantum eraser basis [13J as in the which- 
way basis. 



IV. WEAK VALUES 

The proposal in this Letter is to characterize p us- 
ing the theory of weak values 0]. A weak value is a 
the mean value of the result of a weak measurement of 
some quantity A. This is a measurement which yields 
an arbitrarily small amount of information about A, and 
disturbs the system correspondingly weakly in doing so. 
This means that the ensemble giving the average must be 
correspondingly larger than that which would be needed 
for averaging a strong (projective) measurement. 

Weak values are interesting (for example, lying out- 



side the range of eigenvalues of A in the case when 
they are post-selected on the obtaining of a certain re- 
sult from a projective measurement at a later time. If 
the initial state is \ipi) and the projector for the desired 
final measurement result is \<f>f)(<f>f\, then the weak value 
of A conditioned on this final result is 

^|(^)|^>- Re {(f>flA) ■ 

This prediction was soon verified experimentally |16| . 
Since then post-selected weak values have found many 
applications, including defining tunneling time in a di- 
rectly observable manner [T?). and resolving quantum 
paradoxes 0|- With a few simple generalizations, weak 
values have also been found |l9| to explain puzzling as- 
pects of a well-known cavity QED experiment [20] • 

Given these successes of weak value theory, it is natu- 
ral to apply it to the question of momentum transfer in 
WWMs. A schematic experiment is shown in Fig. 1. 
After the initial state \ipi) is formed, but before the 
WWM, a weak measurement of the projector is 
made. If this were a strong measurement then the mean 
value (whether post-selected or not) would be \ ipi\tpi)\ 2 , 
the probability for the initial momentum to be pi. For 
a weak measurement, an individual result is given by 
KftlV'i)! 2 + where S is a standard normal variable 
and a 2 is an arbitrarily large parameter (equal to 1 / dt in 
Ref. ^3)- The quantum back-action of this measurement 
on the system state is, to leading order in a , 

\i/>i) - [1 + (\ Pi )(pi\ - \(p^i)\ 2 ) SI 2a] |^>. (4.2) 

Following the weak measurement, there is the WWM 
represented by the operators {0{\ in Eq. (|2.1I) . The 
post-selecting measurement is a strong measurement of 
the final momentum pf. The weak measurement and fi- 
nal measurement can be constructed as appropriate for 
a classical particle. Thus, the post-selected weak value 
would classically be interpreted as P(pi\pf). Since free 
evolution preserves momentum, if there were no WWM 
then one would expect that P(pi\pf) = 5{pi — Pf), and 
any deviation from this due to the WWM would be in- 
terpreted as a momentum transfer. 

Using the generalized theory of Ref. [19j , the weak val- 
ued conditional probability P W v{Pi\Pf) which would be 



4 



measured for a quantum system is 
M (\Pi){Pi\)\^) = R° 



(p/|o c |^)<^|o+| P/ ) 

(4-3) 

(Recall the summation convention.) Now the joint 
and conditional distributions are related by P(pi,pf) — 
P(pi\pf)P(j>f). Since the weak measurement introduces 
negligible disturbance to the system, the denominator in 
Eq. JO} is simply P(pf), so 



,(pi,p f ) =Re{( Pf \6 e \p i )(p i \i) i )(ii> i \6l\ Pf )}. (4.4) 



We can rewrite this as a function P wv (p,pi) of the mo- 
mentum transfer p — pf — pi. If one then repeated the 
experiment for all values of P i, one could integrate this 
over all pi to finally obtain the weak-valued probability 
distribution as 

Pvv(p) = Jdpi Re | ^ + p|O s \pi) 

x (Pi\i>i)(ipi\dl\pi + p}y (4.5) 

It is now straightforward to evaluate Eq. I|4.5|l using 
Eq. I|2.1|l . The resulting distribution for the momentum 
transfer caused by the WWM is remarkably elegant: 

P wv (p)=Re{6 s (p)Q|(p)}. (4.6) 

The Fourier transform is as defined in Eq. 12.5fl . and 



Q £ (x) = Ot(xMi(x)\ 2 



(4.7) 



Thus the probability distribution for p depends upon the 
initial state in a very natural way. Note that P wv (p) 
may be negative, although for classical momentum kicks 
Pwv(p) = -Pciassicai(p) > 0. If there is no WWM, so that 
0^{x) = 1, then P wv (p) = S(p) as one would wish. 



V. PROPERTIES OF P W v(p) 

From Eq. (|4.6|1 and Eq. 12.3fl a number of results are 
easily proven using the moment-generating function 



<%) = dpP wv (p)e l ^ h 



(5.1) 



= Re J dx\ipi(x)\ 2 $ (x)Ol(x - q). (5.2) 

First, P wv (p) is normalized, as $(0) = 1. Second, using 
the Schwartz inequality, |$(g)| < 1, as it would be for a 
true (non-negative) probability distribution. Third, since 
the moments of p are given by 



(p») wv = (-ihB/dq) n Hq)\ g = , 



(5.3) 



it follows from Eq. (|5.2|) that if the are flat (i.e. 
have all derivatives zero) in the region of the slits where 



\ipi(x)\ 2 is nonzero, then all of the moments of p are 
zero. Thus the claim of SEW, that their scheme would 
not transfer any momentum to the particle, could be val- 
idated experimentally by calculating the moments of p 
using the measured P wv (p). 

Fourth, despite this last fact, P wv (p) also reflects 
the change in the momentum distribution caused by a 
WWM. For narrow slits at x = ±s/2, 



1. 



-Re 



Of 



01 



o. 



(-1) 



o: 



(I) 

(5.4) 

Using Eq. I|2.3[) and Eq. (|2.4|l and the triangle inequality, 
it can be proven that |$(s)| < 1/2 for V = 0. Since in 
addition $(0) = 1 and \§(q)\ < 1 V<j, it follows from the 
theorem in Appendix A of Ref. 01 that 



Support [Pwv(p)] <t {-irh/3s,nh/3s) 



(5.5) 



This establishes the weaker result, quoted in the abstract 
and Sec. I, that the support of P wv (p) is not contained 
in [-h/s,h/s] [23. 

To illustrate the above results, consider the minimal 
WWM: a projective measurement of the sign of x, so 
that 0±(x) = Q(±x). For narrow slits, Eq. I|4.6|l gives 



Pwv(p) = - 



sin ps/2 



Tip 



(5.6) 



This clearly satisfies Eq. 1)5. 5|l . and yet has all moments 
equal to zero in a distributional sense For example, 
if the mathematical approximation of Gaussian slits of 
width a is used, then one obtains a distribution whose 
moments are well-defined in ordinary calculus, and which 
vanish exponentially quickly in s/a. 

As noted above, the property of having zero moments 
but not being a ^-function is possible only because of 
the fact that P wv (p) takes negative values. There is no 
logical contradiction here; P wv (p) is not the probability 
distribution for a measurement result. Rather, it is itself 
the average of (weak) measurement results. In classical 
mechanics, the P wv (p) so derived would be guaranteed to 
be the probability distribution for the momentum trans- 
fer p, as could be verified by repeatedly measuring pf 
and pi, and subtracting them. In quantum mechanics 
a measurement of pi would drastically disrupt the sys- 
tem, so we must be content with the first way of finding 
P wv (p), via weak values. Negative weak- valued probabil- 
ities, which have been previously encountered in Ref. 0] , 
are non-classical, but not nonsensical. 

In conclusion, I have shown that using weak measure- 
ments one can directly observe a (weak-valued) probabil- 
ity distribution P(p) for momentum transfer in which- 
way measurements. It has all the properties one would 
desire of such a distribution except that it may take nega- 
tive values. This is unavoidable since for the SEW scheme 
the distribution reflects both the fact that all of the mo- 
ments of p are zero, and the fact that P(p) ^ for some 
p outside the interval [— h/s, H/s]. Techniques similar to 
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those used in Ref. 16] should enable interesting instances Research Council, 
of this distribution, such as that in Eq. (|5.6|) . to be mea- 
sured with current technology. 
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FIG. 1: Diagram of a twin-slit which-way experiment. The 
initial state \tpi) is formed by the slits and propagates lon- 
gitudinally towards the final screen. This is in the far field, 
so detecting the position of the particle there is equivalent to 
measuring its final momentum pf. With the which-way mea- 
surement (WWM) device in place the distribution for pf is 
just the single-slit diffraction pattern. In its absence, a twin- 
slit interference pattern would form. Between the slits and the 
WWM device is a device which makes a weak measurement 
of \pi){pi\, as explained later in the text. 



